Composite Group-Keys - Space-Efficient Indexing of Multiple Columns for Compressed In-Memory Column Stores
نویسندگان
چکیده
Real world applications make heavy use of composite keys to reference entities. Indices over multiple columns are therefore mandatory to achieve response time goals of applications. We describe and evaluate the Composite Group-Key Index for fast tuple retrieval via composite keys from the compressed partition of in-memory column-stores with a main/delta architecture. Composite Group-Keys work directly on the dictionary-encoded columns. Multiple values are encoded in a native integer and extended by an inverted index. The proposed index offers similar lookup performance as alternative approaches, but reduces the storage requirements significantly. For our analyzed dataset of an enterprise application the index can reduce the storage footprint compared to B+Trees by 70 percent. We give a detailed study of the lookup performance for a variable number of attributes and show that the index can be created efficiently by working directly on the dictionary-compressed data.
منابع مشابه
OPTIMAL PERFORMANCE-BASED SEISMIC DESIGN OF COMPOSITE BUILDING FRAMES WITH RC COLUMNS AND STEEL BEAMS
Composite RCS building frames integrate reinforced concrete columns with structural steel beams to provide an efficient solution for the design and construction of earthquake-resisting structures. In this paper, an optimization framework is developed for performance-based seismic design of planar RCS moment resisting frames. The objective functions are defined as minimizing the construction cos...
متن کاملCache Conscious Column Organization in In-Memory Column Stores
Cost models are an essential part of database systems, as they are the basis of query performance optimization. Based on predictions made by cost models, the fastest query execution plan can be chosen and executed or algorithms can be tuned and optimized. In-memory databases shift the focus from disk to main memory accesses and CPU costs, compared to disk based systems where input and output co...
متن کاملFast Column Scans: Paged Indices for In-Memory Column Stores
Commodity hardware is available in configurations with huge amounts of main memory and it is viable to keep large databases of enterprises in the RAM of one or a few machines. Additionally, a reunification of transactional and analytical systems has been proposed to enable operational reporting on the most recent data. In-memory column stores appeared in academia and industry as a solution to h...
متن کاملAdvancing the discovery of unique column combinations (Technische Berichte des Hasso-Plattner-Instituts für Softwaresystemtechnik ; 51)
Unique column combinations of a relational database table are sets of columns that contain only unique values. Discovering such combinations is a fundamental research problem and has many different data management and knowledge discovery applications. Existing discovery algorithms are either brute force or have a high memory load and can thus be applied only to small datasets or samples. In thi...
متن کاملHiding Approach for Efficient Image Indexing
In this letter, we propose a data hiding approach to embed the indexing keys inside the JPEG compressed images for their retrieval, inspired from the spirit of digital watermarking. At the stage of database population, the existing approach is to compress all images by standard JPEG in order to save their storage space, but ignored the compression of their indexing keys under the notion that th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014